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Abstract 

The Komlos conjecture in discrepancy theory states that for some 
constant K and for any m x n matrix A whose columns lie in the unit 
ball there exists a vector x G { — 1, +1}™ such that ||Ax||oo < K. This 
conjecture also implies the Beck-Fiala conjecture on the discrepancy of 
bounded degree hypergraphs. Here we prove a natural relaxation of the 
Komlos conjecture: if the columns of A are assigned unit vectors in R" 
rather than ±1 then the Komlos conjecture holds with K = 1. Our 
result rules out the possibility of a counterexample to the conjecture based 
on semidefinite programming. It also opens the way to proving tighter 
efficient (polynomial-time computable) upper bounds for the conjecture 
using semidefinite programming techniques. 



1 Introduction 

Let % = {Hi, . . . , H m } be a hypergraph with vertex set V = [n]. In this work 
we study the combinatorial discrepancy of hypergraphs and related quantities. 
The discrepancy of T~L is defined as 



discfH) = min max 

X:[n]->{_1,+1} {=1 



Intuitively, discrepancy is the optimization problem of coloring the vertices of 
a hypergraph, so that the most imbalanced edge is as balanced as possible. 
Thus discrepancy is intimately connected to problems in Ramsey theory that 
study conditions under which every coloring leaves some edge monochromatic. 
Discrepancy has applications in geometry, computer science, and numerical in- 
tegration, among others — the books by Matousek [TU], Chazelle [5J, and the 
chapter by Beck and Sos [5J provide references for a wide array of applications. 

We will be particularly interested in the discrepancy of hypergraphs with 
maximum degree bounded above by a parameter t, i.e. hypergraphs H all of 
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whose vertices appear in at most t edges. It is a classical result of Beck and 
Fiala [4] that for any T~L of maximum degree at most t, disc("H) < 2t — 1. 
Furthermore, they conjectured that disc("H) < C\ft for an absolute constant 
C . Proving Beck and Fiala's conjecture remains an elusive open problem in 
discrepancy theory. 

As usual, we define the incidence matrix of % as an m x n 0-1 matrix A 
such that Aij = 1 if and only if j G Hi. In matrix notation discrepancy can be 
defined as disc("H) = min x£ {_ 1 t i\n HAx^. This algebraic formulation allows 
us to extend the definition of discrepancy to arbitrary matrices: disc(A) = 
min xe {_i || Ax||(xj. Interpreted in this way, discrepancy is a vector balancing 
problem: our goal is to assign signs to a given set of n vectors (the columns 
of A), so that the signed sum has small norm (infinity norm in our case). 
A natural restriction on A, analogous to the maximum degree restriction for 
hypergraphs, is to bound the maximum of some norm of the columns of A. Such 
vector balancing problems were first considered in a general form by Barany 
and Grinberg [3J, although a similar problem was posed as early as 1963 by 
Dworetzky. The proof of Beck and Fiala shows that for any A whose columns 
have l\ norm at most 1, disc(A) < 2. Komlos conjectured^ that for A whose 
columns have 1% norm at most 1, disc(A) < K for some absolute constant K . 
The Komlos conjecture implies the Beck-Fiala conjecture and also remains open. 
The best partial progress towards proving the Komlos conjecture is a result by 
Banaszczyk |lj, who showed the bound disc(A) < K^/\ogn for an absolute 
constant K . This is the best known bound for the Beck-Fiala conjecture as 
well. 

In this paper we are concerned with a natural convex relaxation of discrep- 
ancy: vector discrepancy. Vector discrepancy is defined analogously to discrep- 
ancy, but we "color" [n] with unit n-dimensional vectors rather than ±1: 



where S"^ 1 is the unit sphere in 1". Vector discrepancy is a relaxation of dis- 
crepancy, i.e. vecdisc(A) < disc(A) for all matrices A: a coloring x achieving 
disc(A) induces a vector coloring {ui = Xjei}™ =1 (e\ being the i-th standard 
basis vector) achieving vector discrepancy with the same value. Vector discrep- 
ancy was used by Lovasz to give an alternative proof of Roth's lower bound on 
the discrepancy of arithmetic progressions 7 . A natural question is whether a 
lower bound on vector discrepancy could disprove the Komlos conjecture. Our 
main result is a negative answer to this question. 

Theorem 1. For any m x n real matrix A whose columns have £2 norm at 
most 1, vecdisc(A) < 1. 

1 The earliest reference we can find is the 1987 book 'Ten Lectures on the Probabilistic 
Method' by Spencer Q2] 
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Except as a means to lower bound discrepancy, vector discrepancy has also 
recently proved itself useful in establishing efficient upper bounds on discrep- 
ancy. In a recent breakthrough, Bansal [5] showed the following theorem. 

Theorem 2 ( 2 ). Let A be a real m x n matrix and assume that for any 
submatrix B of A we have vecdisc(B) < D. Then disc(A) < D ■ Klogm, and, 
furthermore, there exists a polynomial time randomized algorithm which on input 
A outputs x G { — 1, 1}™ such that, with high probability, ||Ax||oo < D ■ Klogm 
for an absolute constant K . □ 

In light of Bansal's result, Theorem [1] implies that for any A whose columns 
lie in the unit ball disc(A) < K log m and that a coloring x achieving this bound 
can be found in randomized polynomial time. Such an efficient upper bound 
for the Komlos conjecture was proved by Bansal [2], and later using different 
methods by Lovett and Meka [S]. However, Bansal's, and Lovett and Meka's 
upper bounds are based on the "partial coloring" method and a log n factor 
seems inherent to upper bounds for the Komlos conjecture derived using this 
method. On the other hand, Matousek [IT] conjectures that the logm factor 
in Theorem [2] can be improved to y/log m. If this conjecture holds, we would 
have an alternative, and efficient proof of Banaszczyk's upper bound. We note 
that Banaszczyk's proof does not obviously yield an efficient algorithm, and no 
polynomial time algorithm that matches his bound is currently known. 

To the best of our knowledge, Theorem[T]establishes the first constant upper 
bound on the vector discrepancy of matrices with bounded column li norms and 
on the vector discrepancy of bounded degree hypergraphs. A weaker bound of 
0(y/Tog 77i ) can be derived in a variety of ways: directly from Banaszczyk's 
upper bound; from the existence of constant discrepancy partial colorings for 
the Komlos conjecture; from Matousek's recent upper bound [llj on vector 
discrepancy in terms of the determinant lower bound of Lovasz, Spencer, and 
Vesztergombi [8]. Our bound is tight, as vecdisc((l)) = 1, for example. 

Techniques. Our proof of Theorem [JJ relies on a dual characterization of 
vector discrepancy, first used by Matousek to show that the determinant lower 
bound on discrepancy is almost tight [11] . However, our result does not follow 
directly from Matousek's techniques, which only imply a bound of 0(\f\og m). 
Vector discrepancy is equivalent to a semidefinite programming problem, and, 
using a variant of the Farkas lemma for semidefinite programming, we can can 
formulate a dual program which is feasible for a parameter D precisely when 
vecdisc(A) > D. We assume that the dual program is feasible for D = 1 + 
e. Geometrically, this feasibility can be formulated as the existence of two 
ellipsoids E and F such that F C E and the sum of squared axes lengths of E 
is at most a D factor larger than the sum of squares axes lengths of F, The 
containment F C E and the fact that the columns of A lie inside the unit ball 
imply that the largest fc-dimensional section of E has volume lowerbounded by 
the largest fc-dimensional section of F, for all fc. Therefore, the axes lengths of 
E multiplicatively majorize the axes lengths of F, and Schur convexity implies 
a contradiction to the assumed constraints on the axes lengths of E and F . 
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2 Preliminaries 

In this section we introduce some basic notation and useful linear algebraic facts. 

2.1 Notation 

We use boldface to denote matrices: A, X. We denote the entry in the i-th row 
and j-the column of A as Aij . We denote by range( A) the vector space spanned 
by the columns of A, and by ker(A) the kernel (nullspace) of A. We'll assume 
a generic matrix A has dimensions m by n. By || • || we denote the standard £ 2 
norm. 

For a real symmetric matrix X, we use X >; to denote that X is positive 
semidcfinitc. 

For a real m by n matrix A, we define the discrepancy of A as 

disc(A) = min IIAxlloo. 
xe{-i,i} n 

We define the vector discrepancy of A as 

n 

AjjUj , 

^ 2 

where S*™ -1 is the (n — l)-dimensional unit sphere in M™. As noted earlier, 
vecdisc(A) < disc(A) for all A. 

2.2 Dual Characterization of Vector Discrepancy 

For each matrix A, vecdisc(A) is defined as the minimum value of a convex 
function over a convex set, i.e. as the value of a convex optimization problem. In 
particular, vecdisc(A) 2 can be written as the optimal solution to the semidefinite 
program 



min£> (1) 

subject to (2) 

VI < i < m : (AXA T )„ < D (3) 

VI < i < n : X u = I (4) 

X h 0. (5) 



To see the equivalence, write the vectors iii , . . . , u n forming a vector coloring 
as the columns of the matrix U and set X = U T U y 0. Also, by the Cholesky 
decomposition of positive semidefinite matrices, any X ^ can be written as 
X = U T U where the columns of U are unit vectors and therefore give a vector 
coloring. 



vecdisc(A) = min max 
ui,...,u„es n - 1 »=i 
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Using strong duality for convex programming, we can derive the dual pro- 
gram to (H])-© an d characterize vecdisc(A) 2 as the optimal (maximum) so- 
lution to this dual. A derivation of the dual appears in recent work by Ma- 
tousek . Next we present the resulting characterization of vector discrepancy. 
For a detailed proof of Theorem [3J see [TT] . 

Theorem 3 For any real m x n matrix A, vecdisc(A) > D if and only 

there exists a distribution p over [m] and a vector w £ K™ satisfying 

3 

such that for all z £ W 

n n 

E^^A^) 2 > ^> j2 2 . (7) 

3=1 3=1 

□ 

We note a geometric interpretation of Theorem [3j Define the ellipsoids 
E(p,A) = {z : E^iZj^AijZj) 2 < 1} and F(w) = {z : Ej=i ^ ? < 1>- 
Then Theorem [3] states that vecdisc(A) > D if and only if there exists a distri- 
bution p and w satisfying (j6|) such that F(w) C E(p, A). 

2.3 Linear Algebra 

The following two lemmas are essential to our proof. We suspect they are 
standard, but include detailed proofs for completeness. The first lemma states, 
geometrically, that any fc-dimensional section of an ellipsoid E has volume upper 
bounded by the volume of the section with the subspace spanned by the k longest 
axes of E. We state and prove the lemma using elementary linear algebra. 

Lemma 1. Let X € R" x ™ : X y be a symmetric real matrix with eigenvalues 
o~i > • • • > o~ n > 0. Let also U £ W ixk be a matrix with mutually orthogonal 
unit columns. Then det(U T XU) < a\ . . .o%. 

Proof. Let Ui and U2 be two matrices with mutually orthogonal unit columns 
such that range(Ui) = range(U 2 ). We claim that det(UjXUi) = det(UjXU 2 ), 
and, therefore det(U T XU) is entirely determined by range(U). Indeed, there 
exists a unitary matrix S such that UiS = U2, and, therefore, det(UjXU2) = 
det(S T UTxU!S) = det(U^XUi) det(S) 2 = det(Uj'XUi). 

The lemma is trivially true if det(U T XU) = 0, so we will assume that 
U T XU is non-singular. The proof of the lemma proceeds by induction on k. 
In the base case k = 1, the matrix U is just a unit vector u. By the min-max 
characterization of eigenvalues we have that for any unit vector u, det(u T Xu) = 
u T Xu < o\. Let us assume the lemma holds for k — 1. Let vi, . . . , v n be the 
eigenvectors of X associated with cri, . . . , a n . Since range(U) is a vector space 
of dimension k, we have that range(U) n span{v2, . . . , v n } is a vector space 
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of dimension at least k — 1. Let Uo € M nxfc - 1 be a matrix whose columns 
are an orthonormal basis for some k — 1 dimensional vector space contained 
in range(U) n span{v2, . . . , v n }. Let P be a projection matrix for the space 
span{v2, . . . , v„}. Since the columns of Uo are elements of span{v2, . . . , v n }, 
we have that PU = U , and, therefore, UjpTXPUo = UjXU . Also, the 
top n — 1 eigenvalues of P T XP are a<z, . , . , <j n , and, by the induction hypothesis, 
det(U r XU ) < C2 . . . <r n . Let u be a unit vector in range(U ) ± n range(U) 
and let Ui be the matrix resulting from appending u as a column of Uo . Since 
range(U) = range(Ui), we have that det(U T XU) = det(UfXUi). A direct 
calculation shows that 



T _ / UjXUo UjXu \ 
Ul XUl - (, u T XU u T Xu ) 



Therefore, 



det^XUx) = (^XuJdetCUjXUo - U o X ^XU ) 

u _X.ii 

Let v = ^ x UqXu. Since X is symmetric, the equality above reduces to 

dct(UjXU 1 ) = (u T Xu) detfUjXUo - v T v). Since we assumed that U T XU 
is non-singular, we know that range(Uo) H kcr(X) C range(U) n kcr(X) = 0, 
and, therefore, UjXUo is also non-singular. By the matrix determinant lemma, 
det(Uj"XU - v T v) = dct(U : XUo)(l- v T (U3'XUo)- 1 v). Since UjXUo t 
0, (UjXUo)" 1 t 0, and therefore v T (UjXU ) _1 v > 0. So we have that 
det(U^XUi) < (u T Xu) det(U r XUo). By the min-max characterization of 
eigenvalues, u T Xu < o\ and this completes the inductive step. □ 

Lemma 2. Let X e R nxn : X h and Y E R nxn : Y h 0. Suppose that 

Vu 6 M™ : u T Xu > u T Yu. 

Then, det(X) > det(Y). 

Proof. For a symmetric real matrix M y 0, define the ellipsoid E(M.) = {u : 
u T Mu < 1}. E(M.) is unbounded if and only if M is singular. Otherwise, 

vol(£(M)) = V Ol( ' S " ) , (8) 
V V " ^det(M) V ' 

where B n is the n-dimensional unit ball. 

By assumption, E(X) C E(Y). If det(Y) = 0, the lemma is trivially true. If 
dct(X) = 0, then E(X.) is unbounded and therefore E(Y) is unbounded, which 
implies det(Y) = 0. If, on the other hand, E(X.) and E(Y) are bounded, we have 
that vol(£(X)) < vol(£(Y)), and, by ©, det(X) > det(Y), as desired. □ 
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3 Proof of Main Theorem 

We begin with an inequality which can be seen as a converse to the geometric 
mean-arithmetic mean inequality. The inequality follows from the Schur con- 
vexity of symmetric convex functions; we present a self-contained elementary 
proof using a powering trick. 

Lemma 3. Let x\ > . . . > x n > and y\ > . . . > y n > such that 

Vfe < n : Xi . . .x k > yi . . .y k (9) 

Then, 

Vfc < n : xi + . . . + x k > yi + ■ ■ ■ + yk- (10) 

Proof. We will show that for all positive integers L, (x\ + . . . + x n ) L > ^x{yi + 
■ ■ - + Vn) L - Taking L-th roots, we get that x\ + . . . + x n > ^Ii/l {yi + ■■ ■ + y n )- 
Letting L — > oo and taking limits yields the desired result. 
By the multinomial theorem, 

(zi + . . . + x n ) L = V . L ' , xl 1 
£ — ' i\\ . . . i n \ 

il + ...+i„=L 

The inequalities imply that whenever i\ > . . . > i n , x] 1 . . . x l ™ > y^ . . . y 1 ^ . 
Therefore, 

(x 1 + ...+ Xn ) L > 7T^—,<h ■■■■«'■;■ a 1 ) 

. ~ ^ . i\. . . .x n . 

?i > ...>i n 

ii+...+i n =L 

Given a sequence ii,...,i n , let a be a permutation on n elements such 
that > •■• > ier(n)- Since y\ > ... > y n , we have that y^' 1 ' •••J/rT (n> > 
y^ 1 . . . y^™ . Furthermore, there are at most n\ distinct permutations of i\, . . . , i n 
(the bound is achieved exactly when all it, . . . ,i n are distinct). These observa- 
tions and the multinomial theorem imply that 

(yi + -.. + y n ) L < £ j^-y^.-.y^. (12) 

21 >...>l n 

ii+...+i n = L 

Inequalities pT|) and ([12")) together imply (x\ + . . . + x n ) L > X {y\ + . . . + y n ) L 
as desired. □ 

We are now ready to prove our main result. 

Theorem 4 (Theorem Q] restated). For any matrix A g R mx ™ SMC /j thatWi £ 
[n] : ||A*i|| < 1, vecdisc(A) < 1. 



Proof. We will use Theorem [3] with Z? = \/l + e for an arbitrary e > 0. For any 
w £ M. n satisfying X)"=i Wi > 1 + e we will show there exists a z € R™ satisfying 

n 

Ej-pfcAijZi) 2 <£>i*?. (13) 
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Therefore, by Theorem [31 vecdisc(A) 2 < 1 + e for all e > 0, which proves our 
main theorem. 

For any i : Wi < 0, we can set Zi = 0. Then Y^i- w >o Wi — Si Wi > 1 + e. 
Consider also the submatrix A' consisting of those columns A*i of A for which 
Wi > 0. The matrix A' satisfies the assumption that all its columns have norm 
bounded by 1. Therefore, it is sufficient to show that for any matrix A with 
columns bounded by 1 in the euclidean norm, any w such that Vi : Wi > and 
S™=i Wi > 1 + e, and any distribution p on [to], there exists a z satisfying (fT5]l . 

We denote by W the diagonal matrix with w on the diagonal, and similarly 
for any distribution p 6 R™ : Y^JLi Pi = 1 we denote by P the diagonal matrix 
with p on the diagonal. In this matrix notation, we need to show that for any 
positive definite diagonal matrix W such that Tr(W) > 1 + e, and any positive 
scmidcfinitc diagonal matrix P such that Tr(P) = 1, there exists a vector z £ R n 
such that z T A T PAz < zWz. 

Assume for contradiction that 

Vz : z T A T PAz > zWz. (14) 

Geometrically, this is equivalent to F{p, A) C E(w), where F and E are defined 
as before. The outline of our proof is as follows. The relation F(p, A) C 
E(w) implies that, for all k, the largest fc-dimensional section of F has volume 
lower bounded by the volume of the largest fc-dimensional section of E. Using 
Lemma [T] and the Hadamard bound we can show that this implies that, for 
all k, the product of the k largest pi is lower bounded by the product of the 
k largest Wj . Then, Lemma [3] implies that the sum of all pi is lower bounded 
by the sum of all Wj, which is a contradiction. We proceed to prove the above 
claims formally. 

Let, without loss of generality, w\ > . . . > w n > and similarly p\ > . . . > 
Pm > 0. Denote by Arw the matrix (A*i, . . . , A*k) and by Wt the diagonal 
matrix with w\ , . . . , Wk on the diagonal. We first show that 

Vfc < n : det(A£]PA [k] ) < Pl ... Pk . (15) 

Let Ui, . . . Uk be an orthonormal basis for the range of Arw and let Uk be the 
matrix (ui, . . . Uk). Then A[ k ] = UkUjA[ k ]- Each column of the square matrix 
UjA[ k ] has norm at most 1, and, by Hadamard's inequality, det(Aj^jUk) = 
det(UjA [k] ) < 1. Therefore, 

Vfc < n : det(A^PA [k] ) < det(U k r PU k ). 

By LemmaQ] we have that det(U k : PUk) < Pi ■ ■ -Pk, which proves (fTo) . 

By ([11]) we know that for all k and for all u e M fe , u T A^PA[ k ju > u T W k u, 

since we can freely choose z such that zi — for all i > k. Then, by LemmaQ 
we have that 

Vfc < n : det(A^jPA [k ]) > det(W k ) =wx...w k (16) 
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Combining (fT5)l and (|16p . we have that 

\/k<n:p 1 ...p k >w 1 ...w k (17) 

By Lemma El dTTJ) implies that 1 = Y^JLiPj ^ YJj=\Pj ^ w,: > 1 + e, a 

contradiction. 

□ 

4 Conclusion 

We have shown that the vector discrepancy of a matrix A all of whose columns 
are contained in the unit ball is bounded by 1 from above. This result establishes 
a natural vector discrepancy variant of the notorious Komlos and Beck-Fiala 
conjectures. On one hand our result can be seen as evidence in support of the 
conjectures: they cannot be disproved by lower bounding vector discrepancy. 
On the other hand, our work opens the possibility of giving an efficient proof of 
Banaszczyk's bound of 0(y/\ogm) on disc(A) by improving the pseudoapprox- 
imation algorithm of Bansal [2] . We hope that our result would prove useful in 
an attack on the Komlos conjecture itself. 
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